rank | frequency | n-gram |
---|---|---|
1 | 254074 | -s |
2 | 106716 | -a |
3 | 92136 | -u |
4 | 88785 | -i |
5 | 84066 | -m |
rank | frequency | n-gram |
---|---|---|
1 | 59167 | -as |
2 | 34985 | -es |
3 | 28411 | -em |
4 | 25815 | -is |
5 | 23668 | -am |
rank | frequency | n-gram |
---|---|---|
1 | 27592 | -iem |
2 | 19488 | -ies |
3 | 10644 | -jas |
4 | 9746 | -nas |
5 | 9597 | -ais |
rank | frequency | n-gram |
---|---|---|
1 | 8248 | -ties |
2 | 7915 | -jiem |
3 | 6834 | -anas |
4 | 6523 | -ijas |
5 | 4382 | -šana |
rank | frequency | n-gram |
---|---|---|
1 | 5525 | -ajiem |
2 | 5166 | -šanas |
3 | 2805 | -oties |
4 | 2656 | -cijas |
5 | 2482 | -šanai |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings